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Field Of The Invention 

[0001] The invention relates to a method and a device, in which 
noise filtering is applied. The invention further applies to a 
5 video system. 



Description Of The Related Art 

[0002] There is presently an increasing interest in digital 
transmission of image sequences, e.g., through the Internet. 

10 Especially in the consumer electronics area, the sources of these 
images, such as, video cameras, video recorders, satellite 
receivers and others, are affected by various types of noise. In 
particular, in the case of CCD and CMOS cameras, the sensor noise 
is usually modeled as white Gaussian, whereas vertical or 

15 horizontal streaks may be found in video scanned from motion 

picture films or played by a video recorder, respectively. Before 
storage and/or transmission, it is advisable to reduce the noise 
level in the images, both to improve the visual appearance and to 
reduce the bit rate. Various algorithms are known in the art for 

20 the attenuation of noise having different distributions, these 
algorithms being generally very complex and, consequently, not 
being amenable to real-time implementation in consumer equipment, 
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or providing poor performance, typically introducing artifacts and 
smoothing edges . 

SUMMARY OF THE INVENTION 
5 [0003] An object of the invention is to provide less complex 
noise reduction. To this end, the invention provides a method of 
and a device for noise filtering and a video system. 
[0004] In a first embodiment of the invention, a type of noise 
in the signal is estimated, and one of at least two noise filters 

10 is enabled, the enabled noise filter being one most suitable for 

the estimated type of noise. The invention is based on the insight 
that estimating a type of noise and automatically enabling one 
filter out of a set of simple filters, each favorable to a specific 
noise type, is more effective than a complex filter which has to 

15 cope with different noise characteristics. Both the noise type 

estimation and the filters have a low complexity and are amenable 
for low-cost applications . 

[0005] Edge preserving noise reduction can be achieved using 
spatio-temporal rational and median-based filters. A rational 

20 filter is a filter described by a rational function, e.g., the 

ratio of two polynomials in input variables. It is well known that 
spatio-temporal rational filters can effectively distinguish 
between details and homogeneous regions by modulating their overall 
low-pass behavior according to the differences of suitably chosen 

25 pixels (see Ref. [1]), so that noise is significantly reduced while 
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details are not blurred. They are effective on various types of 
noise, including Gaussian noise (Ref. [1]) and contaminated 
Gaussian noise (Ref. [2]). Contaminated Gaussian noise has a 
probability distribution according to: 



wherein X is a parameter and N(a) is a Gaussian distribution with 
variance a. A variance of the contaminated Gaussian distribution is 
10 given by: 



In case of long-tailed noise, a simple median filter (Ref. [3]) is 
15 used, which is effective both for single noisy pixels and for 
horizontal and vertical streaks, so that there is no need to 
distinguish between ideal and real impulsive noise. Median-based 
operators are very efficient in case of long-tailed noise, 
especially impulsive noise, while their use in case of Gaussian 
20 noise is not advisable, because they tend to generate streaking and 
blotching artifacts . 

[0006] A further embodiment of the invention uses a simple 
algorithm to estimate the type of noise in the image sequence. This 
embodiment uses a kurtosis of the noise as a metric for the type of 
25 noise. The kurtosis is defined as (Ref. [4]): 



5 




(1) 



(2) 
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k = ju 4 /(J 4 



(3) 



where (x 4 is a fourth central moment of the data, and a is a 
5 variance of the data in the image sequence. The fourth central 
moment is given by: 



10 where E is an expectation of a variable and E(x) = x . The fourth 

central moment \i 4 is related to the peakedness of a single-peaked 
distribution. The kurtosis is dimensionless with k = 3 for a 
Gaussian distribution. A kurtosis value of 3 therefore means that 
the noise distribution has, in some sense, a same degree of 

15 peakedness as a member of the normal family. Further, k > 3 for 
contaminated Gaussian noise, and k >> 3 for impulsive noise. 
[0007] Prior art operators which are able to distinguish among 
several types of noise, are very complex. For example, in Ref . [5], 
a block-based, non- linear filtering technique, based on Singular 

20 Value Decomposition that employs an efficient method of estimating 
noise power from input data, is presented. However, an hypothesis 
of additive noise is required, and only Gaussian distributions are 
used. In Ref. [6], in order to detect and estimate both 
deterministic and random Gaussian signals in non-Gaussian noise, 

25 the covariance of the latter is determined using higher order 
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cumulants. The inverse problem is treated in Ref. [7], where signal 
detection and classification in the presence of additive Gaussian 
noise is performed using higher-order statistics. 
[0008] The input signal x is formed by an original noise- free 
5 signal y and a noise signal n according to: x = y + n. In a further 
embodiment of the invention, the noise n is approximated by 
computing a difference between the signal x and the same signal 
being noise filtered, preferably in a median filter (Ref. [8]). A 
median of N numerical values is found by taking a middle value in 

10 an array of the N numerical values sorted in increasing order. A 
median filter may also be referred to as a non-linear shot noise 
filter, which maintains high frequencies. Due to the well-known 
noise reduction and edge preserving properties of the median 
filter, the resulting signal, z = x - median (x) , is composed 

15 approximately of noise only, i.e., z=n. The kurtosis k is then 
estimated on z to provide an indication of the type of noise. 
Although z does not coincide with the original noise n, for 
reasonable values of the noise variance (in case of Gaussian noise 
or contaminated Gaussian noise) or of a percentage of noisy pixels 

2 0 (in case of impulsive noise) , the parameter k allows to correctly 
discriminate the types of noise, using two suitable thresholds. 
There is no overlap in values of the parameter k for Gaussian, 
contaminated Gaussian and long- tailed noise, so that it is actually 
possible to correctly discriminate the various noise types using 

25 two thresholds, being 6 and 15. 
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[0009] Preferably, because the noise is supposed to be spatially 
uniform, a small part of each image (e.g., 3 by 3 pixels sub-image) 
is analyzed, in order to keep the computational load per image low. 
Because a stable estimate is needed, an analysis is preferably 
5 performed by cumulating data for a plurality of images before 

actually computing k. An estimate over 900 pixels (i.e., over 100 
frames) has a reasonable low variance. 

[0010] The aforementioned and other aspects of the invention 
will be apparent from and elucidated with reference to the 
10 embodiments described hereinafter. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0011] In the drawings: 

[0012] Fig. 1 shows an embodiment of a video system according to 
15 the invention; 

[0013] Figs. 2A-2D show exemplary spatial directions considered 
in the filters: Fig. 2A: horizontal, Fig. 2B: vertical, Figs. 2C 
and 2D: diagonal; 

[0014] Fig. 3 shows an exemplary direction used by a temporal 
2 0 part of a rational filter for Gaussian noise; and 

[0015] Fig. 4 shows an exemplary combination of directions used 
by a temporal part of a rational filter for contaminated Gaussian 
noise . 

[0016] The drawings only show those elements that are necessary 
25 to understand the invention. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 



[0017] 



Fig. 1 shows an embodiment of a video system 1 according 



to the invention. The video system 1 comprises an input unit 2, 
such as a camera or an antenna, for obtaining an image sequence x. 
5 The video system 1 further comprises a noise filter 3. The noise 
filter 3 comprises a noise discriminator 30 for estimating a type 
of noise in the image sequence x. The noise discriminator 30 
controls a set of filters 31. Depending on the estimated type of 
noise, a most suitable filter in the set of filters 31 is enabled. 

10 [0018] The noise discriminator 30 comprises a median filter 301, 
a subtractor 302 and a noise type estimator 303. The median filter 
301 filters the input signal x to obtain a filtered version of x, 
being median (x) . The filtered signal median (x) is subtracted from 
the input signal x, resulting in an approximation of the noise n in 

15 the input signal x, the approximation given by: z = x - median (x). 
The signal z is furnished to the noise type estimator 303 for 
estimating the type of noise. As described above, the noise type 
estimator 303 applies a kurtosis k on the noise signal z. The noise 
type estimator 3 03 furnishes a kurtosis (noise type) depending 

20 control signal to the set of filters 31. Depending on the control 

signal from the noise type estimator 303, one of the filters in the 
set of filters 31 is enabled. The output y of the noise filter 3 
may be transmitted to a receiver or stored on a storage medium. 
[0019] In a preferred embodiment, the set of filters 31 

25 comprises three different filters 310, 311, 312 in order to be able 
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to treat different types of noise. Their operation is automatically 
controlled by the noise discriminator 3 0 as described above. 
Preferably, their support is restricted to two temporally adjacent 
images only, to keep the computational complexity low. The use of 
only two images has the further advantage that the amount of 
required image memory is lower than in methods that use more 
images. In this embodiment, the filter 310 is suitable for Gaussian 
noise, the filter 311 is suitable for contaminated Gaussian noise, 
and the filter 312 is suitable for long-tailed noise. 
[0020] The filters for the Gaussian noise and the contaminated 
Gaussian noise 310, 311 are preferably spatio-temporal rational 
filters having a similar structure, constituted by the sum of a 
spatial and a temporal filtering part. Each filter output y 0 is 
computed as : 



7o ~ ^0 J spatial J temp ( 5 ) 

where x 0 , x± and Xj are pixel values within a mask {x 0 being the 
central one) , i, j e I describe a set of spatial filtering 
directions shown in Figs. 2A-2D, and k s and A s are suitable filter 
parameters. The temporal filtering part, f t em P has a similar form, 
although f t em P operates also on pixels of a previous image, and is 
described below. It may be seen that the spatial filter is able to 
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distinguish between homogeneous and detailed regions in order to 
reduce noise while maintaining the image details. In fact, if the 
mask lies in a homogeneous region, the pixel differences 
(xi - Xj) 2 which appear at the denominator are small, and the high- 
5 pass component present at the numerator, which is subtracted from 
x 0/ gives an overall low-pass behavior. In turn, if the same 
differences have a large value, an edge is supposed to be present, 
and the filter leaves the pixel unchanged in order not to blur the 
detail . 

10 [0021] The temporal part exploits the same principle of detail 
sensitive behavior, and for Gaussian noise, the form is similar to 
that of the spatial part: 



ieJ k n {xf — X Q ) + A a 



15 



where i € J describes a set of temporal filtering directions as 
shown in Fig. 3. In Fig. 3 only one of 9 possible directions 
(according to the possible positions of x± p ) has been drawn for the 
sake of clarity. The superscript p refers to pixels belonging to a 

2 0 previous image, and k tl and A tl are suitable filter parameters. 
[0022] The situation is slightly more complicated for 
contaminated Gaussian noise. In this case, details and noise are 
more difficult to discriminate, because the pixel noise level can 
be large (due to the rather long tails of the distribution) , and 

2 5 less information with respect to the spatial case is available; 
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more precisely, due to the limited temporal size of the filter 
support (only two images) , pixels are available only at one 
(temporal) side of x 0 (vice-versa, in the spatial part of the 
filter 311, pixels both at the right and at the left of x 0f or both 
5 on top of and below, are available) , so that the simple denominator 
of the spatial part does not allow to distinguish between a single 
noisy pixel and the edge of an object. For contaminated Gaussian 
noise, ftemp is defined as: 

— x p 4- r 

q rcont. Gauss _ -*7 ^ A Q , g x 

em " ~k[k a (x> -x 0 ) 2 +k l3 (x!> - Xi ) 2 ]/2 + A t2 

where i e J describes a set of temporal filtering combinations (a 
combination of a temporal direction with a spatial direction) as 
shown in Fig. 4, and where k t2t k t 3 and A t2 are suitable filter 

15 parameters. In Fig. 4, only one combination of x± p and x± of a 

plurality of possible combinations has been drawn for the sake of 
clarity. In this case, the pixels at the denominator, which 
controls the strength of the low-pass action, are three instead of 
two, i.e., x if Xi p and x Q . In fact, as already mentioned above, it 

2 0 is not advisable to use the same control strategy as for Gaussian 
noise: the difference {Xi P - x 0 ) may be large due to a noise peak 
instead of an edge with consequent loss of the noise filtering, 
action. In turn, if the same difference is corrected by averaging 
with another difference, i.e., (x/ - x±) , the denominator remains 
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low also in presence of isolated noisy pixels, and the desired low- 
pass behavior is obtained. 

[0023] Although the filters 310 and 311 are shown in Fig. 1 as 
separate filters, in a practical embodiment, the filters 310 and 
5 311 are combined in one rational filter with a common spatial part 
and different temporal parts, a first temporal part for Gaussian 
noise and a second temporal part for contaminated Gaussian noise. 
Depending on the type of noise estimated in the noise discriminator 
30, the suitable temporal part is enabled. In a further practical 

10 embodiment, the first temporal part and the second temporal part 
are implemented as one temporal filtering part according to 
equation (8) , wherein, in case the noise has a Gaussian 
distribution, the parameter k t3 is taken zero to obtain a rational 
filter according to equation (7) . 

15 [0024] The rational filter 310/311 is enabled if the value of 
the kurtosis k of z is lower than 15, otherwise the median filter 
312 is enabled. If the kurtosis k is lower than 6, the first 
temporal part (for the Gaussian noise) is enabled. If the kurtosis 
k is between 6 and 15, the second temporal part (for the 

20 contaminated Gaussian noise) is enabled. 

[0025] In order to treat long-tail noise effectively, the filter 
312 is preferably a simple median filter. In general, a median 
filter is based on order statistics. A two-dimensional median 
filter is given by: 
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y 0 = median{x i , x 0 , x. } 



(9) 



The set x if Xj defines a neighborhood of the central pixel x 0 and is 
called a filter mask. The median filter replaces the value of the 
5 central pixel by the median of the values of the pixels in the 

filter mask. A simple mask, which is appropriate, is a 5 element X- 
shaped filter. Such a filter is known from Ref . [3] , In case of the 
5 element X-shaped filter, the filter mask includes the central 
pixel x 0 and the pixels diagonally related to the central pixel x 0 . 

10 These spatial directions are indicated in Figs. 2C and 2D. 

[0026] Preferably, both ideal impulsive noise (single noisy 
pixels), and real world impulsive-like noise (e.g., present in 
satellite receivers) made of horizontal one pixel wide strips 
rather than by single noisy pixels, are removed. Both types of 

15 noise affect only one pixel out of 5 in the X-shaped mask, so that 
the noisy element is easily removed by the median operator. It 
should be noted that one pixel wide vertical strips, which may be 
found in video obtained from motion picture films, can also be 
effectively removed by this filter. To remove wider strips, a 

2 0 larger support is required. Once impulsive noise type has been 
detected, the simple median is used. 

[0027] The noise discriminator 30 controls the set of filters 
31. Although in the above-described embodiments, hard switching is 
used, soft switching is also possible, e.g., enabling the most 
25 suitable filter of the set of filters 31 by more than 50% and, in 
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addition, partly enabling one or more of the other filters in the 
set of filters 31. In an exemplary case in which the signal 
includes mostly Gaussi an noise, the filter 310 may be enabled for 
80% and the other two filters 311 and 312 for 10%. The claims 
5 should be construed as comprising such a soft switching 
implementation too. 

[0028] Depending on the application or the image sequence, other 
filters or a different noise discriminator may be used. The basic 
idea of the invention is to use at least two filters, designed for 

10 different types of noise, and a noise discriminator for enabling 

the most suitable filter of the at least two filters. The invention 
is also applicable to other signals, e.g., audio. 
[0029] Motion-compensated based algorithms generally provide 
better performances at the cost of a much more complex structure. 

15 Motion-compensated based algorithms are preferably applied in 
professional embodiments of the invention. 

[0030] It should be noted that the above-mentioned embodiments 
illustrate rather than limit the invention, and that those skilled 
in the art will be able to design many alternative embodiments 

20 without departing from the scope of the appended claims. The word 
'image' also refers to picture, frame, field, etc. In the claims, 
any reference signs placed between parentheses shall not be 
construed as limiting the claim. The word 'comprising' does not 
exclude the presence of other elements or steps than those listed 

25 in a claim. The invention can be implemented by means of hardware 
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comprising several distinct elements, and by means of a suitably 
programmed computer. In a device claim enumerating several means, 
several of these means can be embodied by one and the same item of 
hardware. The mere fact that certain measures are recited in 
5 mutually different dependent claims does not indicate that a 
combination of these measures cannot be used to advantage. 
[0031] In summary, the invention provides noise filtering of a 
signal by estimating a type of noise in the signal and enabling one 
of at least two noise filters, the enabled noise filter being a 

10 most suitable filter for the estimated type of noise. An 

approximation of the noise in the signal is obtained by computing a 
difference between the signal and a noise- filtered version of the 
signal. The invention uses a kurtosis of the noise as a metric for 
estimating the type of noise. If the estimated type of noise is 

15 long-tailed noise, a median filter is enabled to filter the signal. 
If the estimated type of noise is Gaussian noise or contaminated 
Gaussian noise, a spatio-temporal filter is enabled to filter the 
signal. The invention may be applied in a video system with a 
camera and a noise filter. 

20 
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